ON SMALLEST TRIANGLES 



Geoffrey Grimmett and Svante Janson 

Abstract. Pick n points independently at random in R 2 , according to a prescribed 
probability measure fi, and let A" < < . . . be the areas of the (™) triangles 
thus formed, in non-decreasing order. If fi is absolutely continuous with respect to 
Lebesgue measure, then, under weak conditions, the set {n a Af : i > 1} converges 
as n — > oo to a Poisson process with a constant intensity k((j.). This result, and 
related conclusions, are proved using standard arguments of Poisson approximation, 
and may be extended to functionals more general than the area of a triangle. It is 
proved in addition that, if fj, is the uniform probability measure on the region £>, then 
k (a0 2/|iS|, where \S\ denotes the area of S. Equality holds in that n(fi) = 2/\S\ if S 
is convex, and essentially only then. This work generalizes and extends considerably 
the conclusions of a recent paper of Jiang, Li, and Vitanyi. 



1. The problem 

Drop n points independently and uniformly at random into the unit square [0, l] 2 
of R 2 . Of the (gj triangles thus formed, let A" be the smallest area. It is proved 
in [10] that E(A™ ) lies between cin~ 3 and C2n~ 3 for certain positive real c± and 
C2- An association is proposed in [10] between this problem and the (much harder) 
Heilbronn problem of finding the largest value of A™ for deterministic arrangements 
of n points; see [3] and the references therein. A novel aspect of [10] is the use of 
Kolmogorov complexity. 

Our target in this paper is to present a considerable strengthening of the above 
result, and to extend it to more general measures than uniform measure on [0, l] 2 . 
This will be done using the now standard approach of Poisson approximation and 
the so-called Chen-Stein method, see [2]. From this point of view, the problem 
is a fairly simple exercise in probability theory, and does not of itself merit the 
intervention of concepts imported from complexity theory. 

In broad terms, our extensions are as follows. First, we shall prove for the above 
problem on the square that 72 3 A™ has, asymptotically, the exponential distribution 
with parameter 2 (that is, with mean ^). Furthermore, the sequence A™ < A2 < 
. . . of smallest areas of triangles, written in non-decreasing order, is such that 
n 3 {A™ , A2 , . . . } converges as n — ► 00 to a Poisson process with intensity 2. These 
weak-limit theorems are complemented by the convergence of expectations obtained 
via an exponential bound on the tails of the A.f. 
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Very similar results are valid when the underlying point distribution \i is uniform 
on a region S (C W 2 ) having non-zero finite Lebesgue measure \S\. In this case, 
the constant 2 is replaced by a constant n(p) satisfying k(p) < 2 / j iS' j - Furthermore, 
k(h) = 2/\S\ if S is convex, and more fully k(/j,) = 2/\S\ if and only if S differs 
from a convex set by a null set. Strict inequality holds, for example, if S is the 
closure of its interior and is non-convex. 

Indeed, corresponding results hold for a family of n points drawn from IR 2 ac- 
cording to a general probability measure \i having a density function / satisfying 
the boundedness conditions (Al) and (A2) below. In this very general situation, 
one obtains a formula for the ensuing constant k(/j,) in terms of /. Our results in 
this general setting are stated in Section 2. In Section 3, we set the scene by adapt- 
ing to the present problem some of the basic theorems of Poisson approximation 
(see [2]) and exponential bounds (see [9]). 

We return in Section 4 to the case of the uniform distribution on the unit square, 
where we give explicit proofs for this case. Special prominence is afforded to this 
instance since it has featured in [10], and since its resolution contains most of the 
basic ingredients of the general case. It is in addition useful to see the required 
arguments displayed in their simplicity without the intervention of certain minor 
but significant complications which arise in the setting of general measures. 

Proofs in the general setting may be found in Section 5, and some properties of 
the constant k{ji) are established in Section 6. 

Further extensions come readily to mind, but we have not worked through the 
details. For example, for random points in IR 3 , one might study the smallest area of 
a triangle formed by three points, or the smallest volume of a tetrahedron formed 
by four points, and similarly in higher dimensions. One might also study problems 
arising with points in other (Riemannian) manifolds, for example the smallest area 
of a geodetic triangle formed by random points on a torus or a sphere. Yet another 
possibility would be to allow fi to be singular, perhaps uniform measure on an arc 
of M 2 . The basic ingredients of such extensions are the arguments used in Section 
3. 

The work reported here is quite closely related to that of [14], where Poisson 
limit theorems were established for certain [/-statistics, examples of which include 
the number of short inter-point distances and the number of 'flat' triangles created 
by an independent family of points in R 2 with a common density function. 

Our triangle problem has a property which raises its visibility above that of 
some other applications of the approach, namely the following. We will see in the 
proofs of Sections 4 and 5 that, although the smallest triangle has small area (with 
order n -3 ), its diameter does not tend (stochastically) to as n — > oo, but remains 
of order 1. This fact underlies the observation that, in the case of the uniform 
distribution \x on a region S, the shape of S affects the value of the constant k(/j,). 

2. Notation and results 

Let ji be a probability measure on the Borel cr-field of M 2 , and let X, Xi, X 2 , . . . , X n 
be points chosen independently at random according to the measure fx. The n points 
Xi,X 2 ,...,X n form triangles, whose areas we write in non-decreasing order 
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as A™ < A£ < For a > 0, we shall be interested in the number T n (a) of such 

triangles with area not exceeding an -3 , which is to say that 

T n (a) = \{k:A n k <an- 3 }\. 

The target of this paper is to establish conditions on fx under which T n (a) has an 
approximate Poisson distribution, and more ge nerally that the set n 3 {A^, A£, . . . } 
is approximately distributed Poisson process. 

We assume throughout that \i is absolutely continuous with respect to Lebesgue 
measure, which is to say that there exists an integrable (density) function / : IR 2 — > 
[0, 00) such that n(A) = J A /(x) <ix for any Borel set A. There follow two conditions 
on /. 

(Al) The function f is bounded, in that there exists M < 00 such that /(x) < M 
for all xGK 2 . 

This condition will prevent the clustering of points. A further condition concerns 
certain marginals of /, and this is prefaced with some notation. For notational 
convenience, we identify M 2 with the complex plane, and we let L(r, 9) denote the 
line {re ie + ite %e : —00 < t < 00}. Note that L(r, 6>) = {x : x • (cos 9, sin6>) = 
r}, and that L(—r,9) = L(r,9 + n). The family of all lines is given by either 
{L{r,0) : < 9 < 2tt, r > 0}, where L(0, 9) is identified with L(0, 9 + tt), or as 
{L(r, 6>) : < 9 < 7r, —00 < r < 00}. We write /l for the integral of / along 
the line L, and denote fL(r,9) by f r ,e where convenient. Note that the mapping 
r 1— > f r: g is the density function of the projection re(X) = X ■ (cos sin 9) of X 
onto the line through the origin in the direction 9. We shall assume that all these 
marginal densities are uniformly bounded. 

(A2) There exists N < 00 such that, for all r, 9, f r ^ < N. 

In our first theorem, we identify a constant which will play an important role in 
what follows. We write |x| for the Euclidean norm of x (e M?). 

Theorem 2.1. Assume that (Al) and (A2) hold, and let X, Y be chosen indepen- 
dently according to [i. Let L{R, G) be the straight line passing through the points X 
and Y. The constant 

exists and satisfies < k < oo. Furthermore, 

(2.2) k = - / f^ e drd9. 

6 JO J -oo 

We are ready to state our main theorem. For accounts of vague and weak 
convergence, see [8], [11]. 

Theorem 2.2. Assume that (Al) and (A2) hold. 

(a) The set n 3 {A", /SQ , ■ ■ ■} converges vaguely to a Poisson process with con- 
stant intensity k = k(h), and the convergence is weak when restricted to any 
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bounded interval o/R. In particular, the random variable n A™ converges 
weakly, as n — > oo, to £/ie exponential distribution with parameter k. 

(b) If E(|X| 6 ) < oo /or some 5 > 0, £/ien positive moments o/n 3 A™ converge 
to the corresponding moments of the exponential distribution with parameter 
k. In particular, E(A") ~ l/(«n 3 ) asn-> oo. 

(c) There exist absolute positive constants A = A(/j,), B = B(fS) such that the 
total-variation distance between T n (a) and the Poisson distribution with 
parameter X = X n (a) = E(T n (ct)) satisfies 

lrrml \ „ ,^„, \ \ \ , 1 — e _A a 2 login 3 /a) 

d TY (T n {a), Po{X n {a))) < A M ' , 

a n 

for < a/n 3 < B and n > 3, where, for a > 0, X = X n (a) — > bo as 
n — > oo. 

The conclusion of part (b) is valid with replaced by A^ +1 — A™ for any given 
i. We shall return briefly to this point after the statement of Theorem 3.4. 

With X chosen according to fx, let V = V(fx) denote the covariance matrix of 
the 2-vector X, and write | V| for the determinant of V. If E(|X| 2 ) = oo, we write 
\V\ = oo. The constant k may be calculated numerically using the formula in (2.2), 
and there is a universal lower bound on the product k(/j) \V(/j) \ 1 / 2 . More important, 
there is an explicit formula for k valid whenever \i is uniform on a convex region. 
Prior to stating formally this last fact, we make a definition. A measurable subset 
S of M 2 is called essentially convex if there exists a set iV (C M 2 ) with Lebesgue 
measure such that the symmetric difference S A iV is convex. It is trivial that 
every convex set is essentially convex. 

Theorem 2.3. Assume that (Al) and (A2) hold. 

(a) The product k(h) \V{jj) I 1 / 2 is invariant under non-singular affine mappings of 
, and satisfies 

(2.3) k(/0W/0| 1/2 > ^ for all v. 

(b) Let S be a subset o/R 2 with non-zero finite Lebesgue measure \S\, and let \x be 
the uniform probability measure on S . Then 

(2-4) < ||| . 

Equality holds in (2.4) if and only if S is essentially convex. 

Proofs of the above results may be found in Sections 5 and 6. Here is an example 
of Theorem 2.3(a) in action. Suppose that \i is the standard bivariate normal 
distribution on R 2 . Using the rotational invariance of fx, we have that 

f a ~ —e~ r2 / 2 
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By Theorem 2.1, 



K 



2 r 

= -7T / 

3 7-c 



(2tt)3/2 3v /3- 

By Theorem 2.3(a), if fx is a non-degenerate bivariate normal distribution with 
covariance matrix V, then 



It is easy to see that, in contrast to (2.3), the quantity k(h) |V(/z) j 1 / 2 is not 
bounded above, even for distributions n with E(|X| 2 ) < oo. A simple example is 
given by the mixture of two symmetric bivariate normal distributions with covari- 
ance matrices / and a 2 1 where / is the identity matrix and < a < oo. We omit 
the calculations. 

Theorem 2.3(b) has a striking reformulation in the language of geometrical prob- 
ability. Let S be a region of M 2 having Lebesgue measure satisfying < \S\ < oo, 
and let X, Y be points chosen independently and uniformly from S according to the 
uniform probability measure on S. Let /(X, Y) be the one-dimensional Lebesgue 
measure (length) of the intersection with S of the doubly infinite straight line 
through X and Y. Note that, in the earlier notation, Jr^q = Z(X, Y)/|<S|. 

Corollary 2.4. We have that 

for all S with < \S\ < oo. Equality holds if and only if S is essentially convex. 

In the case of convex S, this exact calculation amounts to a reformulation of a 
classical result of M. Crofton in integral geometry, namely the following. Let l(r, 6) 
be the length of the intersection of L(r, 9) with a given convex region S. Crofton 
proved that 



/ / l(r,e) 3 drd0 = 3\S\ 2 . 

JO J -oo 



His claim may be found in [5], with ancillary references at [4], [13]. Some related 
methods and conclusions of Crofton are summarised in [7, Section 4.13]. 
We close this section with a remark concerning the integrals 

/•7T rCO 

/(p, S) = / / Z(r, e) p dr d6, for p > 0, 

JO J -oo 

where S is assumed convex and / is given as above. It is easily seen that 1(1, S) = 
7r\S\, and Crofton's formula is that 1(3) = 3 1 -S' | 2 . One may ask whether or not 
I(p, S) depends only on l^l for any other value of p. The answer is negative, and 
indeed I(p, S) is not even invariant under affine area-preserving maps of S. As an 
illustration, let S e be the rectangle [0,e _1 ] x [0, e]. By elementary estimates, as 
e -> 0, 

|0 far pe (1,3), 
[P ' e) ~* \ oo for p G (0, 1) U (3, oo). 
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3. Generalities 

In advance of proving the theorems of the last section, we collect together cer- 
tain sufficient conditions for Poisson-limit results to hold. We begin with the 
'one-dimensional' distribution of T n (ct), and follow Barbour and Eagleson [1] as 
interpreted in [2, Thm 2.N]. 

Let U, V, X, Y, Z be chosen independently from M 2 according to fi, and let 
A(X, Y, Z) be the area of the triangle with vertices X, Y, Z. We fix /3 > 0, and 
write 



(3.1) 



tt(/3)=P(A(X,Y,Z)</3), 
TTiGfl) = P(A(X, Y, Z) < p, A(X, U, V) < $), 
tt 2 (/3) = P(A(X, Y, Z) < p, A(X, Y, V) < (3) . 
We let a > and write 



X n (a) 



n 



n(an 3 ), 



the mean number of triangles with area not exceeding an 3 . The following inequal- 
ity will be useful. 

Lemma 3.1. We have that tt(/3) 2 < mi/3) < n 2 (f3). 

Proof. Let H be the perpendicular distance (counted non-negative) from Z to the 
doubly infinite straight line through X and Y, and write A = A(X, Y, Z). Since 
A = |-H"|X - Y|, we have that 

2/3 



(3.2) 



tt(/3) = E P [H < 



IX - Y| 



X 



and the Cauchy-Schwarz inequality yields 



E P H < 



2/3 



< E P [H < 



|X- Y| 

2/3 
IX - Y| 



X 



X 



7Tl(/3). 



Similarly, 



tti(/3) = E f E ^P 



< 



2/3 



< E E P [H < 



|X- Y| 

2/3 
IX- Y| 



X,Y 



X,Y 



X 



X 



7T 2 (/3), 



as required. 

For random variables A, B, we define the total-variation distance 
d TY (A, B) = sup|P(A G E) — ¥(B e E)\ 



□ 



E 



where the supremum is taken all events E. We write Po(7) for a random variable 
having the Poisson distribution with parameter 7. 
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Theorem 3.2. For a > and n>3, 

d TY {T n (a), Po(A" (a))) < i^l_^n 5 7r 2 , 
where A = A n (a) and 7r2 = ^(an -3 ). 

Proof. The set of areas of the Q) triangles formed by the points Xi, X2, . . . , X n is a 
dissociated family of random variables. We now apply [2, Thm 2.N] with (3 = an~ 3 . 
The upper bound follows via an application of Lemma 3.1. □ 

It is an immediate corollary that 

P(A? > an' 3 ) = P(T n (a) = 0) 

satisfies 

(3.3) |P(A? > an~ 3 ) - e" A "^ I < I^-^n 5 7r 2 , 

2A 

whence n 3 A™ has, asymptotically as n — > 00, the exponential distribution with 
parameter lim^^oo A n (a)/a, whenever: (a) this limit exists and is independent of 
a, and (b) the right-hand side of (3.3) tends to zero for every a > 0. We shall 
verify these conditions for suitable /j in the subsequent sections. 
We turn now to the set n 3 A = n 3 {A™ , Ag, . . . }. 

Theorem 3.3. Assume that there exist constants c, d G (0, 00) such that, for all 
a > 0: 

(a) X n (a) — > ca as n — > 00, 

(b) A n (a) < c'a /or a/Z n, and 

(c) n 5 TT2(an~ 3 ) ^0 as n — > 00. 

T/ien n 3 A n converges vaguely to a Poisson progress £ with constant intensity c, 
and, for all T > 0, the set n 3 A n fl [0, T] converges weakly to the process £ restricted 
to [0, T] . In particular, n 3 A" converges weakly to the exponential distribution with 
parameter c. 

Proof We apply Theorem 3.2 and Lemma 2.2 of [8] (see also [11]), with J the semi- 
ring of half-open intervals of the form (a, 6], a, b > 0. Note that, in the notation of 
[8], if U e 1 is such that U C [0, T] for some T, then 

¥(Xj EU)< ¥(Xj < T), 

and similarly for F(Xj £ U, Xk G C/'). Lemma 3.1 is used to bound ni by 712- □ 

In Sections 4 and 5, we shall verify conditions (a), (b), (c) of Theorem 3.3 under 
the conditions stated in Section 2. 

The above theorems will be used to prove weak convergence, but they can imply 
only little about the convergence of expectations. We concentrate here on the 
random variable n 3 A", noting that similar arguments are valid for other elements 
of the set n 3 A n . We shall require an estimate for the tail of n 3 A", in combination 
with the results above, in order to study its expectations. 
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Theorem 3.4. We have that 

P(n 3 A? > a) < exp (-A n («) 2 /M n (a)) for a > 

where 

M n (a) = max{4A n (aO,6n 5 7r 2 (mi" 3 )}. 

Proof. This is an application of Lemma 3.1, and Theorem 4 of [9]. □ 

A corresponding exponential bound for the tail of any given Af, fori > 1, 
follows from Theorem 10 of [9], and may be used in a similar way to that of the 
proof of Theorem 2.2(b) in order to obtain the convergence of the moments of A™ 
as n — > oo; see the remark after the statement of Theorem 2.2. 

4. Uniform distribution on the unit square 

This section contains an analysis of the special instance in which \i is the uniform 
measure on the unit square S = [0, l] 2 . We give special emphasis to this case for 
two reasons. First, it is the case considered in [10], and secondly, it is valuable to 
see an example of the required estimates where other minor but significant matters 
do not arise. 

We assume henceforth in this section that // is the uniform measure on the unit 
square 5, and we next identify the constant c for an application of Theorem 3.3. 

Theorem 4.1. Let X, Y be chosen independently according to fx, and let /(X, Y) 
be the Lebesgue measure (length) of the intersection with S of the doubly infinite 
straight line through X and Y. The constant 

(4.1) 2 E ^ X ' Y > 



3 Vl x -Y| 

exists and satisfies < k < oo. 

We shall see in the proof of Theorem 2.3 (see Section 6) that k = 2 for this 
special instance of fx. 

Proof. The basic estimate necessary for the proof is the following. There exists an 
absolute constant v (> 1) such that 

(4.2) P(|X- Y| < e|X) < z/e 2 for all e > and all X. 
For reasons which will emerge soon, we shall assume that 

(4.3) v > e. 
It follows that 

/•OO poo 

E(|X-Yr 1 )=/ P(|X- Yp 1 > u)du < / min{l,z/w- 2 }rfw = 2v^7. 
Jo Jo 

The diameter of S is \/2, and therefore, 

B <^K(|X-Y|-) = ^, 

as required. □ 

We next verify the conditions of Theorem 3.3 with c = k, obtaining thereby 
the Poisson-process limit for the set n 3 A n , as n — > oo. Conditions (a) and (b) are 
implied by the next lemma, proved below. 
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Lemma 4.2. We have that X n (a) < Ka for all a,n, and X n (a) — > b« as n — > oo, 
/or a// a > 0. 

We shall require an upper bound for ^{P) valid for small /?. 

Lemma 4.3. There exist absolute constants A, B, with B > 0, such that ^(P) < 
Ap 2 log ft' 1 for 13 <B. 

It is an immediate consequence of Lemma 4.3 that 



Ao? 

(4.4) 7r 2 (an~ 3 ) < — ~- log(n 3 /a) = 0(n~ 6 logn) for a > and large n, 

n b 

thus verifying condition (c) of Theorem 3.3. It follows that the set n 3 A n satisfies the 
conclusions of Theorem 3.3, namely the vague and weak convergence to a Poisson- 
process limit. In addition, Theorem 3.2 provides a bound tending to zero for the 
total- variation distance between T n (a) and the appropriate Poisson distribution. 
An extra ingredient is needed in order to prove convergence of expectations, as 
stated next. 

Theorem 4.4. For p = 1,2,..., the random variables {|n 3 A™| p : n > 3} are 
uniformly integrable, and therefore, as n — > oo, E(|n 3 A™| p ) — > p\c~ p , the pth 
moment of the exponential distribution with parameter c. 

The proof is given at the end of this section. 

Proof of Lemma 4-2. The letter L denotes a doubly infinite straight line of M 2 . For 
x, y G M 2 , x 7^ y, we write L(x, y) for the doubly-infinite straight line through x 
and y. For e > 0, we write <Se(x, y) for the closed strip containing all points whose 
Euclidean distance to L(x, y) does not exceed e. 

Let X, Y, Z be chosen independently according to n, and let H be the absolute 
value of the length of the perpendicular dropped from Z onto L(X, Y). We have 
that A = A(X, Y, Z) satisfies 



(4.5) P(A < | X,Y) =P ( H < ~" ; 



X- Y| 



X,Y 



= p(zg5 2j9/ | X -y|(X,Y)|x,y) 



4/3/(X,Y) 
~ IX-YI 



with equality if the event C(p) does not occur, where C((3) is the event that 
<^2^/|x-Y|(X, Y) contains one or more of the four corners of S, and 1e denotes 
the indicator function of an event E. Thus, 



1 



P(A </3)-E 



4/3/(x,y; 

IX-YI 



, 4 X, Y 
' X- Y 



C{(3) 



We have by the absolute continuity of ji that P(C(/?)) — > as f3 — > 0, and it follows 
by Theorem 4.1 and the monotone convergence theorem that 

(4.6) ^P(A < P - E ( ^ X, Y| ) aS ^ 1 °- 
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As an immediate consequence, we obtain that 

\ fn\^. A n. , , ..n 3 ck /4/(X, Y 

X n (a) = P(A < an~ 3 ) = (1 + o(l))— • — ?E ' 



3/ v ~ y v w/ 6 n 3 V IX- Y| 
k« as n — > oo. 



The uniform upper bound on X n (a) follows from (4.5). □ 
Proof of Lemma 4-3. We have by (4.2) and (4.5) that, for e > 0, 

(4.7) tt 2 (/3)=E(P(A</3|X,Y) 2 ) 

<v(^ i ( W( X > Y ) \\ \ 

< E I 1{|X-Y|< £ } + ^ | X - Y| J 1 {|X-Y|>e} I 



< z/e 2 + 32/3 2 E 
Now, by (4.2) and (4.3), 



l{|X-Y|>e} 

IX- YP 



(4.8) E ( 1 |X-YP > ) = r P (l X - Y l" 2>W ) rfw 

< / min{l, vu x } du 
Jo 

= i/ + v log(e~ 2 /u) < v log(e" 2 ), 

if z/e 2 < 1. We substitute this into (4.7) and minimize over e to obtain that there 
exist A = A(v) and B = B(v) > such that 

tt 2 (/3) < A/3 2 log/3" 1 for < B. □ 

Proof of Theorem 4-4- By a standard argument (see [7, Section 7.10]), it suffices to 
show that, for each p > 1, the set {E(|n 3 A™ | p ) : n > 3} is uniformly bounded. This 
will be achieved by the exponential tail estimate of Theorem 3.4, which we make 
more precise as follows. 

Consider the expression M n (ot) in Theorem 3.4. By Lemma 4.2 and equation 
(4-4), 

/ x . / c 2 a 2 log n 
M n la) < max < ci«, 

I n 

for positive constants Cj and n > 3. Therefore, 

cia it a < — 

M n (a) < 



cia if a < 

c 2 log n 

C2d 2 logn 



otherwise. 

n 
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Furthermore, by (4.6), P(A < (3) > c 3 (3 for < (3 < 1, and thus X n (a) > c±a for 
a < n 3 and n > 3. We apply Theorem 3.4 to obtain 



., cm 

it a < 



(4.9) P(n 3 A™ > a) < 



c 2 log n ' 



exp I — — - — I otherwise; 
logn / 



this probability is evidently for a > n 3 . Therefore, 

pern/ log n 

(4.10) E(\n 3 A™\ p )< pa^e-^da 

Jo 



n 3 



+ I pa p 1 exp ( — Ce,U ) da, 

Jcrn/logn V lo S n J 

which is bounded above uniformly in n. □ 



5. Proofs of Theorems 2.1 and 2.2 

We assume throughout this section that (Al) and (A2) hold, and begin with a 
preliminary lemma. As in Section 2, a line in R 2 is denoted L(r,9), and L(R,<d) 
denotes the line passing through two points X, Y chosen independently according 
to the measure fx. 

Lemma 5.1. Let g(r,9) be a non-negative measurable function on the set of lines 
L(r,6) in M 2 . Then 

E (jX^Yf) = [ fj Mf ^ drde - 



Proof. Using polar coordinates in the form Y = X + ire 10 , r G (— oo, oo), 9 G [0, ir), 
we find 



E 



IX- Yl 



7r roo 



X)=l I 9{Td{ ~ K) > d) f(X + ire* e )\r\drd9 



J-oc I' I 

9(re(X),9)f Te (x.),edff, 



o 



where 70 (X) = X ■ (cos6>, sin6>) as before. Hence, by Fubini's theorem, 

E (^Y[) = £ E (9(Te(X-),9)f Te{ x),e) d9 = £ £ g(r,9)f* e drd9, 
since rg(X.) has density function f r j. □ 
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Proof of Theorem 2.1. It is elementary that k exists and is non-negative. Further- 
more, 

K^EflX-YI- 1 ). 

The finiteness of E(|X - Y]" 1 ) follows just as in the proof of Theorem 4.1, or 
alternatively as follows. By Lemma 5.1, 

(5.1) E(|X-Y|- 1 )=/ / f* 9 drd6<N / f rfi dr d9 = Nir. 

JO J-oo ' JO J-oo 

Equation (2.2) follows from Lemma 5.1 with g(r, 9) = f r ^. □ 

Proof of Theorem 2.2. We show first that conditions (a), (b), (c) of Theorem 3.3 
are valid with c = k. Let A = A(X, Y, Z) where X, Y, Z are chosen independently 
according to (i, and let H be the length of the perpendicular dropped from Z onto 
L(R,0). Then 

pR+h 

(5.2) ¥(H < h | X, Y) = / / r , dr < 2hN. 

J R-h 

Consequently, by (3.2) and (5.1), 

n(P) < 4[3NE(\X - Yl" 1 ) < 4/3iV 2 7r, 

and (b) follows for suitable c'. 

For each 6>, the mapping r h- > f r ^ is integrable. Let Eg be the set of r (g K) 
such that 

i r +e 

2e J r _ e 

and note that, by the Lebesgue differentiation theorem (see [12], Theorem 9.3), the 
complement R\Eq has Lebesgue measure zero. We thus have 



(5.3) i? ( // < 2 " 



(3 V ~ l x ~ Y l 



1 /-fl+2/3/|X-Y| 

X,Y)=-/ / r ,,dr 

P </i?-2/3/|X-Y| 

4/i? ' e as p 0, 



IX - Yl 



whenever ^ i?e- 

For x, y G M 2 , x 7^ y, write L(r,9) for the line through x and y. Define u, 
v by x = (r + iu)e l6 ', y = (r + iv)e lB . The mapping (x, y) 1— > (r,9,u,v), with 
domain {(x, y) : x 7^ y} and range {(r, 9,u,v) : u 7^ v}, is differentiable with 
differentiable inverse. Thus it maps null sets to null sets. It follows that (5.3) 
holds for almost every (X, Y) G M 2 , and thus //-a.s. Together with (5.2) and the 
dominated convergence theorem, we deduce that 



\*(P) = (p (h < J ' :> 



(3 v J (3 V V ~ |X-Y 



X,Y] ) ^4E( ) as 010. 
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Consequently, for any a > 0, 
thus confirming (a). 

Finally, (c) follows as in the proof of (4.4) and Lemma 4.3. Theorem 2.2(a) 
follows. 

We turn now to Theorem 2.2(b). The proof of Theorem 4.4 may be followed until 
equation (4.9), which is valid subject to an upper bound ona/n 3 (with cq depending 
on the bound). Unlike the situation when fx has bounded support, there is in the 
general case no deterministic upper bound on the maximal area of a triangle. Thus 
we shall need a further estimate on the tail of A". Assume that 5 G (0, 1) is such 
that E(\X\ S ) < oo, and note that A = A(X, Y, Z) satisfies 

A < |X- Y| • |X-Z|, 

whence 

E(A 5/2 ) < ^E(|X-Y| 5 -E(|X-Z|*) = E(|X - Y\ s ) < 2E(|X| <5 ) < oo. 

By Markov's inequality, there exists to > 1 such that 

P(A >t)< t~ s/2 E(A s/2 ) < ±t~ s/3 for t > t . 

Therefore, by considering the [n/3\ triangles X!X 2 X 3 , X 4 X 5 X 6 , . . . with disjoint 
vertex sets, 

P(A? >t)< (8t 5 / 3 )-LV3j < 2 2-n t -ln/3}5/3^ f > ^ 

Let p > 1. It follows from the above that 

P(A? >t)< 2 2 ~ n t~ p - 1 for n > 3 + > t > t . 

o 

Hence, for n > 3 + 9(p + l)/S, 

/ pa p " 1 P(n 3 A^ > a) da = n 3p / pt p - 1 P(A^ > t) dt < n 3p ■ 2 2 ">. 

Jton 3 J to 

We combine this with (4.10), with the range of the second integral of that equation 
extended to ton 3 , to deduce as required that the pth moment of A" is uniformly 
bounded in n. This completes the proof of Theorem 2.2(b). 

Part (c) follows by Theorem 3.2 and the proof of Lemma 4.3. □ 

We remark that conditions (Al) and (A2) may be relaxed, but we do not attempt 
here to obtain the weakest necessary assumptions. We note as follows, however, 
that conditions (a) and (b) of Theorem 3.3, with c = k, hold whenever k < oo. 
Under the assumption k < oo, we have by (2.2) and [15, Thm 1], that the Hardy- 
Littlewood maximal function of f r; g is bounded in L 3 . It is now a simple matter to 
replace (5.2) in the above proof. 



R,@ 



IX- Y| 



a 



na as n 



oo, 
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6. Proof of Theorem 2.3 

Finally, we prove Theorem 2.3, beginning with part (a). Let A be a non-singular 
2x2 matrix, let b G K 2 , and write X,- = AKj + b. Denote by Ji the measure 
governing the Xj. Let the areas formed by triangles of the Xj be written Af, so that 
A™ = |A|A™. We have by Theorem 2.2 that n 3 A n converges to a Poisson process 
with intensity and thus k(J1) = The covariance matrix of Xi 

has determinant \V(J1)\ = \A\ 2 ■ |V(aO|, and thus k(JZ) \V(p) | 1/2 = k(^)|F(^)| 1/2 as 
required. 

We prove (2.3) next. Since the mapping r i— > f r j is a density function, 

71" />00 

/ f r jdrd9 = n. 

./-oo 

The Cauchy-Schwarz inequality yields 

(pit pOO \ 2 «7T «00 />7T /"OO q 

/ / fledrde) < / f r , e drd6 / f* e drd6 = ic— , 

by Theorem 2.1, whence, by (5.1), 

(6.1) AEdX-YI- 1 ) 2 . 

On the other hand, by Holder's inequality with exponents | and 3, 

1 = E ( lX-Vp/3 ' l X - Y l 2/3 ) S E (ix^Yf) E(|X - Y|2)1/3 ' 
and thus 

(6.2) E(|X- Yp 1 )" 2 <E(|X- Y| 2 ) = 2(v u + v 22 ), 

where V(pt) = (%•). If V(/z) = /, the identity matrix, (6.1) and (6.2) yield 

/ x 2 1 1 1 



3tt 2(^11 + ^22) 6tt 6tt|"1/(^)| 1 /2' 

and (2.3) is proved in this case. The general case follows since k(/i) \V(h) I 1 / 2 is 
invariant under non-singular affine maps of M 2 . 

We turn to part (b). First suppose that S is convex. For x G 5 and 6> G [0, 27r), 
let r(x, 0) = sup{t : x + te w G 5}. Then, for every x G S, 

(6.3) \S\= / pdpd9= §r(x,0) 2 e20, 

Jo Jo Jo 

and thus, by Fubini's theorem, 

|S| 2 = / / I r (x,0) 2 e2xc20. 
Jo Js 
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The line L(s,9) intersects S in a (possibly empty) interval of length l(s,9), say, 
and, for x in this interval, r(x, 9 + \n) varies linearly between and l(s, 9). Hence, 

p poo rl(s,6) poo 

/ r(x,6>+ ±7r) 2 dx = / / r 2 drds = ±l(s,9) 3 ds. 

JS " J -oo JO J -oo 

Consequently, 

/•2"7T pOO P7T POO 

\S\ 2 = l / l{s,9fdsd9=\ I / l(s,9) 3 dsd9, 

Jo J -oo Jo J -oo 

and thus, since f Sj g = l(s,9)/\S\, 

rf P7T POO rf 

K= sl L fl > dsde= m 

The same conclusion is easily seen to hold under the weaker assumption that S be 
essentially convex. 

For a general region S, let A(x, 9) = {t > : x + te i0 G S}. Then, as in (6.3), 

/•27T 

(6.4) \S\ > / ±\A(x,9)\ 2 d9, 

Jo 

and, as above (except that L(s, 9) fl S need not be an interval), 

/»2"7T /» /»7T pOO 

/ \\A{^9)\ 2 d^d9 =\ / |L(s,0) n^| 3 rfs^ 

JO is io J-oo 

p7T pOO 

= l\S\ 3 / fledsde, 

JO J-oo 

yielding as required that l^l 2 > IIS'I 3 ^. Equality holds only if there is equality in 
(6.4) for almost every x, that is, if A(x, 9), for almost every (x, 6*), differs only by 
a null set from an interval containing 0. 

Let T' denote the set of density points of a measurable subset T of M 2 (recall 
that x is a density point of T if \T fl -B(x, r) |/|£?(x, r) | — > 1 as r — > 0, where S(x, r) 
denotes the ball of radius r with centre x). It may be deduced from the above 
statement concerning the A(x, 9) that 5" is a convex set (this is left to the reader 
to prove) . The symmetric difference T A T' is a null set for any measurable T (see 
[15, Section 1.1] and [6, Thm 2.9.11]), and in particular iV = S A 5" is a null set. 
Thus 5 A iV is convex where iV is null. Hence S is essentially convex and the proof 
is complete. 
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